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METHOD AND APPARATUS FOR PIPELINED 
PROCESSING OF DATA PACKETS. 

Field of the invention 

The present invention relates to data processing in general, and more particularly to a 
method and apparatus for pipelined processing of data. 

5 Baclcground 

A processor which uses a pipelined processing technique and which receives a stream of 
data to be operated upon by the processor can be divided into segments referred to as 
processing stages, each processing stage being capable of executing operations on data. 
The processing stages make up a processing pipeline. Several data packets may 

10 simultaneously be present in the processing pipeline, being operated upon by different 

processing stages, and being brought forward to the next processing stage in the pipeline as 
time flows. Each processing stage can execute operations on the data present in the 
processing stage. Upon each clock tick, data is passed onto the subsequent stage by loading 
registers that are located between the previous and the subsequent processing stages. The 

1 5 data thus becomes present in the subsequent stage. 

Summary 

A problem to which the present invention relates is how to improve the possibilities of 
utilising pipelined processing of data. 



20 This problem is addressed by a processing means for pipelined processing of data packets, 
said processing means comprising an input, an outpvrt and a pipeline comprising at least 
one processing stage. The pipeline is connected between the input and the output. The 
processing means further comprises at least one access point providing access to a device. 
Said device is connected to the access point via a request channel which comprises a 

25 transmit connection for transmitting requests to the device and a receive connection for 

receiving responses from the device. The access point comprises at least one FIFO store for 
storing data entering the access point, a response FIFO store connected to the device via the 
receive connection for storing responses received on the receive coimection, and a 
synchronisation mechanism adapted to synchronise the fetching of the jSrst entry in said at 

30 least one FIFO store and the first entry in said response FIFO store. 
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The problem is fmiher addressed by a method of pipelined processing of a data packet in a 
processing means comprising a pipeline. The pipeline comprises at least one processing 
stage. At least part of the data packet is received in an access point of the pipeline. The at 
5 least part of the data packet and any additional information associated with the data packet 
is stored in at least one FIFO store, A request is transmitted from the access point to a 
device on a transmit connection. A response, corresponding to said request, is then 
received in said access from the device on a receive connection and stored in a response 
FIFO store connected to said device via a receive connection. The first response in said 
10 first response FIFO store and the first entries in said at least one FIFO store are then 
extracted. The response is then merged into the data packet and/or into additional 
information associated with the data packet. 

By the mveadve method and processing means is achieved that data processing 
1 5 functionalities that are less suitable for integration in tiie processing stages can be provided 
by the processing means via devices that can be accessed_fiom the pipeline. Such 
operations could e.g. be operations that require ejqiensive hardware or software, space- 
consuming hardware, functionalities that are especially time-consuming, or operations that 
are seldom performed. The design of the processing stages can therefore be focused upon 
20 operations for which the processing stages are suitable, and the processing stages can be 
made cheaper, smaller and more efficient Furthermore, resources can be saved since a 
device may be used by more than one access point. The synchronisation mechanism assureis 
that a response received from a device is associated with the correct data packet. 

25 In one embodiment of the inventive processang means, an access point can provide 

simultaneous access to more than one device via more than one request chaimels. In the 
inventive method, more than one request can be transmitted simultaneously from the access 
point on more than one transmit connection, where each transmit connection is coimected 
to a different device. Hereby is achieved that the data packet, and/or the additional 

30 information associated with the data packet, can simultaneously be processed by different 
devices. 
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Iq one embodiment of the inventive processing meaxis> the access point further comprises 
means for merging relevant parts of said responses into said data packet and/or into said 
additional information. Hereby is achieved that a device can be used for processes which 
alter the data packet and/or the additional information. 

5 

In one embodiment of the inventive processing means, the synchronisation mechanism is a 
fixed time delay mechanism adapted to initiate a fixed time delay upon entry of at least part 
of a data packet nito the access point In tiie inventive method, a fixed time delay is 
initiated and said extraction of said first response and first entries is performed in 

10 responsive to the elapse of said fixed time delay. Hereby is achieved that tiie distribution of 
data packets may be maintained and that jitter may be avoided. In one aspect of this 
embodiment, the fixed time delay is equal to or longer than the tune required to process the 
most time consuming operation provided to access point by any of the devices. Hereby is 
achieved that the risk of the data block being further processed before a response has been 

15 received by firom the device(s) is eliminated. In another aspect of this embodiment, the 

fixed time d^lay mechanism is adapted to transmitting a triggering signal when the initiated 
fixed tune delay has elapsed. In the corresponding aspect of the inventive method, a 
triggering signal is generated when the fixed time delay has elapsed, and the step of 
extracting is performed responsive to said triggering signal. Hereby is achieved that the 

20 synchronisation of the extraction of the response, the data packet and additional 

information is facilitated. In another embodiment of the inventive processing means, the 
synchronisation mechanism is completion driven conditional logic adapted to determine 
whether all relevant response FIFO store(s) have data in the furst entry. 

25 In an ranbodiment of the inventive processing means wherein the access point provides 
simultaneous access to more than one device, each of the receive coimections of said 
dedicated request chaimels is preferably coimected to a dififerent response FIFO store. 
Hereby is achieved that the processing time of operations provided to an access point by 
the same device may vary between different operations without risking that the responses 

30 are mixed up. Furthermore, having different response FIFO stores connected to different 

devices facilitates for the number of operations invoked fi^om the same access point to vary 
between the processing of different data packets. Hence, all devices connected to an access 
point do not have to be used in the processing of each data packet. 
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In one aspect of ttie inventive procesang means, the transmit comiection is connected to an 
access unit of said access point. The access unit is adapted to receiving data processing 
information associated with said data packet, to using said data processing information in 
5 creating a request and to transmitting the request to said device on said transmit 

connection. Hereby is achieved that a request may be customised for a data packet In one 
aspect of Ifais embodiment, the data processing information comprises iirformation related 
to the handlmg of a response received jBfom the device(s), and the access unit comprises an 
. access unit FIFO store for storing information relating to handling of a response. Hereby is 

10 achieved that altering the response handling on a per packet basis is fecilitated. Suitably, 
the fixed time delay mechanism can be adapted to sendmg said triggering signal at least to 
the access unit, the access unit being adapted to fetching a response in the response FIFO 
store responsive to said triggering signal. Hereby is achieved that said access unit knows 
■when said re^onse is available for further processing. In an aspect of this embodiment 

1 5 wherein Ihe access point provides simultaneous access to more than one device, each 

transmit connection is preferably connected to an access unit. Hereby is achieved that one 
request can be customised for each data packet. 

In one embodiment of the inventive method, data processing information associated with 
20 tiie data packet is extracted prior to the transmitting of the request. The extinction of data 
processing information is suitably performed by use of an access point reference in the 
additional information. The request can then be prepared according to said data processing 
iofoimation, and the mer^boig of the response into the data packet and/or additional 
information can be performed accordmg to said data processing information. Hereby is 
25 achieved that a request may be customised for a data packet. 

In one aspect of this embodiment, the extraction of data processing information is 
performed by extiacting a driver identifier fiwm the access point reference and lookmg up 
said driver identifier in a driver table, the driver identifier being associated with data 
30 processing information in said driver table. The access point of the inventive processing 
means may comprise a driver table comprising data processing information entries and 
associated driver identifier entries, and the access pomt may be adapted to receiving a 
driver identifier and to extractmg, via said driver identifier, data processing information 
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ftom the driver table. Hereby is achieved that the processing resources in the processing 
stages that are used for obtaining the access point reference can be reduced. Furthennore, 
an abstraction layer is introduced to the access point, so that the values of data processing 
information can be altered without changing the operation of the processing stages. The 
5 data processing information corresponding to a value of the driver identifier in the driver 
■ table can be altered, e.g. during configuration of the access point Hence, the exchange of a 
device to a different device, or the change of operations for which a device can be used by 
access point, is &cilitated. 

10 In one embodunent of &e inventive processing means, the processmg means comprises at 
least one switch being connected between at least one transmit connection and at least two 
devices, said switch being configurable to provide access to any one of said devices via any 
of said transmit connections. Hereby is achieved that a transmit connection may provide 
access to different devices at different times. Furthermore, a device may easily be accessed 

IS fiom diff^^nt access points. 

Brief description of the drawings 

The present invoition will now be discussed in more detail with reference to preferred 
embodiments of Ae present invention, given only by way of example, and illustrated in the 
accompanying drawings, in which: 

20 

Fig. 1 is a schematic drawing of a procrasor operating according to a pipelined processing 
technique. 

Fig. 2 is a schematic drawing of an access point, providing access to a device firom a 
25 pipeline in a processor operating according to a pipelined processing technique. 

Fig. 3 is an illustration of an access point reference comprising data processing 
information. 

30 Fig. 4a is an illustration of the transmission of a request to a device fix>m an access point 
and the reception of a response &om the device. 
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Fig, 4b is an example of a request to a device. 

Fig. 4c is an example of a response from a device. 

5 Fig. 5a is aii example of a driver table comprising driver identifier entries and 
corresponding data processing information entries. 

Fig. 5b illustrates an access point reference comprising a driver identifier. 

1 0 Figs. 6a-e is an illustration of a data packet flowing through an access point 

Fig. 7 illustrates an example of an access point providing access to more than one device. 

Fig. 8 is a flowchart illustrating an example of a method of handling the access to a device. 

15 

Fig. 9 is a flQWchart illustrating a procedure performed by an embodiment of an access 
unit 

Detailed description 

In Fig. 1 , an example of a processor 100 operating according to a pipelined processing 
20 technique is illustrated. Processor 100 comprises an input 105, a pipeline 1 10 and an output 
1 15. Input 105 coxild e.g. be a receiver and output 115 could be a transmitter, or input 105 
and output 1 15 could be of simpler design. Pipeline 1 10 of Fig. 1 comprises two processing 
stages 120a and 120b. Obviously, a pipeline 110 may comprise any number of processing 
stages 120. When a data packet 125 enters processor 100, input 105 may, if the data packet 
25 125 is large or small compared to the available bitwidth in pipeline 110, re-arrange data 
packet 125 into one or more data blocks 130. A data block 1 30 may comprise one or 
several data packets 125, or parts of a data packet 125. Input 105 may fiirthermore 
associate additional information 135 to data packet 125. Additional information 135 may 
e.g. comprise infomiation about which instractions should be executed on data packet 125 
30 in pipeline 1 10 (see Swedish patent application 0100221-1, filed by the applicant and 
hereby incorporated by reference). Data block 130 and additional information 135 are 
passed fi-om input 105 to pipeline 110, in wMch the additional information 135, as well as 
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the data block 130, may or may not be operated upon. The processing stages 120a and 120b 
comprise logic units 145a and 145b, respectively, in which the operation on data is 
performed. A data block 1 30 can be operated upon by a logic unit 145 of a processing stage 
120 during the time that elapses between two clock ticks. Upon the next clock tick, data 
5 block 130 and additional information 135 are forwarded to the next processing stage 120, 
via a data block register 150 and an additional register 155. Additional register 1 55 could 
advantageously comprise a graieral purpose register 160 and a special purpose register 165. 
The general purpose register could preferably be used for storage of the part of additional 
information 135 that should be easily accessible, while special purpose register 165 could 
1 0 preferably be used for storage of the part of additional information 135 that is only used by 
certain parts of the pipeline 1 10. Each processing stage 120 should preferably have access 
to a data block register 150 and an additional register 155 for retrieving data that should be 
operated upon by processing stage 120, and another data block register 150 and additional 
register 155 for storing data that has been operated upon by processing stage 150. Two 
adjacent processing stages 120 hence share a data block register 1 50 and an additional 



register 155.. 

In pipeline 1 10, a data block 130 can be processed upon by a logic unit 145 of a processing 
stage 120 during one clock tick. In order to simplify the construction and programming of 

20 the processing stages 120, the processing stages 120 should preferably be identical and 
each processing stage 120 should preferably provide the same functionality. When 
designing processing stages 120 that are to be included in a pipeline 1 1 0, one should 
preferably consider which functionaUties are often used in pipeline 1 10, and, if feasible, 
include them m each processing stage 120. When the processor 100 is used in e.g. a router, 

25 such functionaUties may be forwarding, encapsulating and decapsulafing of data packets. 
To include functionaUties which are rarely used in each processing stage 120, or which 
require a lot of hardware and/or compUcated software, is often too expensive and/or space 
consuming. Examples of such functionaUties may in a router be packet filtering, trafBc 
conditioning and counting of packets. 



30 



A solution to the problem of how to include rarely used or hardware/soflware-consuming 
fimctionaUties in a processor 100 operating according to a pipelined processing technique 
is to connect one or more devices, external to pipeline 1 10, to one or more points in tiie 
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pipeline 110. Such a device coiild be a hard-coded device, a programmable device, a 
memory, or any other device that can be used in the processing of data, and xnay be referred 
to as a co-processor or an engine. The device can be part of processor 100, or be eKtemal to 
processor 100. A fiinctionality that is not included in processing stages 120 can then be 

5 included in one or several devices. Time-consuming fimctionalities can also suitably be 
introduced to a processor 100 via such devices. The queuing of data while a time- 
consuming operation is performed in a device could then e.g. be done in memories. If the 
time-consuming operation were to be carried out in the pipeline 110, a processing stage 
120 would have to be introduced for each instruction necessary for the performance of the 

10 operation. Furthermore, each processmg stage 120 would advantageously have to support 
the operation, thus requiring more hardware in each processing stage 120. Hence, it is often 
cheaper to use a device, external to the pipeline 1 10, for performing time consuming tasks. 

A point in pipeline 1 10 which has access to one or more devices will hereinafter be 
15 referred to as an access point Part or all of the additional information 135 and/or part or all 
of Ihe data bjock 130 can be processed in a device. Normally, the entire data block 130 and 
the entire additional mformation 135 will not flow through the device. Therefore, when 
access to a device is introduced to a pipelme 1 10, care has to be taken so that the result of 
the operation performed m the device is associated with the correct data block 130 when 
20 the result is returned to the pipeline 1 10. If no special action is taken, there is an obvious 
risk of mixing up the order of the results in relation to the order of the data blocks 130. 

According to the invention, the risk of mixing \sp the order of the results that are to be 
returned to the data blocks 130 in pipeline 110 from one or several devices is eliminated by 

25 introducing, m the access point. First In First Out (FIFO) stores at least for the storage of 
data that enters the access pomt, e.g. data block 130 and associated additional information 
135, and for the storage of responses received from the device. Furfliermore, a 
synchronisation mechanism is introduced in the access point, said synchronisation 
mechanism for the synchronisation of fetching the first entries in said FIFO stores, so that 

30 the first entry fetched in the FIFO store(s) for storing data that enters the access point is 

associated with the correct response fetched firom the FIFO store for storing responses from 
Hhs device. 
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According to one embodiment of the invention, the syndironisation mechanism is 
implemented as a fixed time delay mechanism. A fixed time delay can then be introduced 
for each data packet 125 that enters the access point. The fixed time delay can preferably be 
adapted to be equal to, or longer, than the time it takes for the processing of the most time- 

5 consuming bperation that is available in any of the devices that can be accessed from the 

access point When the fixed time delay has elapsed, the first entries of the FIFO stores can 
be fetched for fiirther processing. An advantage obtained by this embodiment of the 
synchronisation mechanisms is that the time used for the processing of a data packet 125 in 
an access point is the same for each data packet 125, so that the distribution of data packets 

10 125 may be maintained and jitter may be avoided. 

According to another embodunent of the mvention, the synchronisation mechanism could 
be implemented as completion driven conditional logic. When the condition of aU relevant 
FIFO stores having data in the first entry is fulfilled, the first entry in each of the relevant 

1 5 FIFO stores is fetched for fiffther processing. Since every packet 125 is not necessarily 
processed by any or all deuces avwlable, information regarding which FIFO stores are 
relevant for a certain data packet 125 could m this embodiment preferably be included in 
additional information 135 and used by the completion driven conditional logic. This 
embodiment can advantageously be used in implementations where the time used for 

20 processing of different data packets 125 does not show significant variations, or where 
jitter does not cause significant problems. 

In Fig. 2, one embodiment of the mvention is illustrated. In order to simpUfy the 
description, only one device 205 at a time can be accessed firom access pomt 200a of Fig. 2 
25 via a transmit connection 210, However, as will be seen later in relation to Fig. 7, other 

embodhnents of the invention can provide simultaneous access to several devices 205 firom 
an access point 200 by use of several transmit connections 210. 

hi Fig. 2, an access point 200a is Ulustrated between processing stage 120i and processing 
30 stage 120i+l , the access point 200a having access to one device 205 at a time, hi Fig. 2, 

four different devices 205i-iv are shown m order to illustrate the possibiUty of configuring 
access point 200a to provide access, via transmit connection 21 0, to either one of devices 
205i-iv, the configuration preferably taking place during the boot procedure of processor 
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100. Each of devices 205i-iv are coxmected to a switch 220 via connections 225i-iv. When 
access point 200a is configured, one of connections 225i-iv is connected to transnut 
connection 21 0 via switch 220. In Fig. 2, the connection 225iii is illustrated to be the 
connection which is connected to transmit connection 210, and hence, device 205iii is the 
5 device 205 which can at present be accessed via access point 200a. The transmit 

connection 210 is preferably fiarther coimected to an access unit 215 of access point 200a. 
The access unit 215 preferably comprises a request port for transmitting requests to device 
205 and a receive port for receiving responses from device 205. Access unit 215 is 
furthermore preferably adapted to preparing requests to be sent to device 205 and to 
10 ^an^Hng responses received from device 205. The functionalities of access unit 215 may be 
physically co-located, but may just as well be distributed amongst different locations in 
access point 200. 

Upon entry in access point 200a, data block 130 and associated additional information 135 

1 5 could suitably enter a request extraction entity 23 7, in which information relevant to the 

access of deyice 205iii is extracted from data block 130 and/or additional information 135, 
and then forwarded to access unit 2l 5, In one embodiment of die invention, such 
information relevant to the access of a device 205 is stored in the general purpose register 
160 of additional register 155 in a processing stage 120. Hence, the contents of general 

20 purpose register 120i, hereinafter referred to as general purpose register contents 230, is 
extracted in request extraction entity 237 and sent to access unit 215. Furthermore, an 
access point reference 240 may be extracted by request extraction entity 237 and forwarded 
to access imit 21 5. Access point reference 240 could advantageously be stored as part of 
the special purpose register 165 in processing stage 120i, and could preferably comprise 

25 data processing information relating to which operations should be performed by device 
205iii, and which data should be used in the operation performed by device 205iii. Data 
block 130 and additional information 135 enter data block FIFO store 245 and additional 
information FIFO store 250, respectively. Access unit 215 could advantageously comprise 
a FIFO store for storing data processing information relating to which actions should be 

30 taken when a response is received from the device 205iii, if any, said FIFO store 
hereinafter referred to as the access unit FIFO store. 
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In an alternative embodimaat, general purpose register contents 230 and access point 
reference 240 could be extracted from additional information 135 by processing stage 1201, 
prior to data block 130 entering access point 1 30, instead of being extracted by request 
extraction entity 237. Request extraction entity 237 could then be omitted. 

Access poiat reference 240 can preferably have been generated by any of the processing 
stages 120 which have operated upon data block 130 and/or additional information 135 
prior to data block 130 and additional information 135 enter access point 200a. An 
example of information that access point reference 240 can comprise is given in Fig. 3, 
wherein access point reference 240 is shown to comprise data processing information 300, 
comprismg data fields 302, 305, 3 10, 315, 320 and 325. Data field 302, heremafter referred 
to as operation code 302, represents the operation that should be requested of device 205iii. 
The nature of the operations that can be performed by a certain device 205 obviously 
depends on the nature of the device 205. For example, if a device 205 is a Content 
Addressable Memory (CAM), then ejcamples of requestable operations could be CAM 
-write, CAM.look-\q) or CAM read etc. For other types of devices 205, examples of 
requestable operations could be division, multiplication, counting of packets, or any otiier 
operation. Preferably, predetermined codes can be used for tiie representation of requested 
operations, so that operation code 302 comprises tiie code tiiat has been agreed to represent 
the requested operation. Preferably, one of Ihe codes could represent a no-operation- 
request, i.e. that no operation is requested. Data processing information 300 of Fig. 3 
further comprises a data field 305, hereinafter referred to as address location 305, 
representing a location in additional information 135 where an address to a memory 
location in device 205 is stored, a data field 310, hereinafter referred to as request data 
location 310. representing a location in additional information 135 where the request data 
is stored, and a data field 315, hereinafter referred to as response data location 315, 
representing a location in additional information 135 where the response data should be 
stored. Data processing information 300 may further comprise a data field 320 representing 
the request data size, referred to as request data size 320, as well as a data field 325 
representing the response data size, referred to as response data size 325. Memory location 
305 can preferably be omitted, should the device 205 not require such addressing. Memory 
location 305, request data location 310 and response data location 315 can advantageously 
point at locations in flie general purpose register content 230. Obviously, data processing 
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information 300 preferably comprises the data fields necessary for handling requests to a 
device 205, and the data processing information illustrated in Hg. 3 is only an example. 

Refemng now to Fig. 2, access xinit 215 determines, based on the data processing 
information 300 included in access point reference 240, v/hat data should be included in a 
request transmitted to device 205iii via transmit connection 210, and then transmits this 
information to device 205iiL Device 205iii processes the data, and returns a response to 
access point 200a via receive connection 255. Receive connection 255 is preferably 
connected between the device 205iii and a response FIFO store 260 in the access point 
200a- Response FIFO store 260 can thra be furflier connected to access unit 21 5. 

In order to synchronise the fetching of data from the FIFO stores, a synchronisation 
mechanism can advantageously be introduced to access point 200. In Fig. 2, the 
synchronisation mechanism is shovm to be a fixed time delay mechanism, the fixed time 
delay mechanism being capable of transmitting a triggering signal when a fixed period of 
time has elapsed since tiie fixed time delay mechanism was initiated. The fixed time delay 
mechanism could preferably be initiated when a data block 130 and associated additional 
information 135 enter access point 200a. The fixed time delay could advantageously be 
adjusted to the number of clock ticks needed in order to complete the most demanding 
operation that could be performed by any of the devices 205i-iv, so tiiat, when the fixed 
time of tiie fixed time delay mechanism has elapsed, the device 205 attaining to the request 
would have had time to process the request and deliver a response to response FIFO store 
260, regardless of which operation in devices 205i-iv was requested. The fixed time delay 
mechanism could advantageously comprise a shift register. Alternatively, the fixed time 
delay mechanism could comprise a counter, or a table comprising the data packets 125 that 
has entered access point 200a, or any other device that could produce a fixed time delay. In 
Fig. 2, tiie fixed time delay mechanism is shown to be a shift register 263. A triggering 
value 265, comprising one or several bits, is shown to have been shifted into the shift 
register 263 in Fig. 2 in order to initiate the fixed time delay mechanism. The size of shift 
re^ster 263 should advantageously be adjusted according to the number of clock ticks 
needed in order to complete the most demanding operation tiiat could be performed by any 
of devices 205i-iv, so that, e.g., the number of positions in shift register 263 equals the 
number of clock ticks needed to by access point 200a to handle the most demanding 
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operation. Upon each clock tick, triggering value 265 moves one position in shift register 
263. 

When the triggering value 265 has reached the end of shift register 263, the triggering 

5 value 265 could preferably be sent to access unit 2 1 5 as a triggering signal 268, the 

triggering signal 268 indicating that the response relatmg to the first entries m data block 
FIFO store 245 and additional information FIFO store 250 have arrived in response FIFO 
store 260. Provided that all requests received by device 205iii are processed in the 
sequence &at they arrive, the first response in response FIFO store 260 wUl correspond to 

10 the first data block 130 m data block FIFO store 245. Upon reception of the triggering 
signal 268. access unit 215 can preferably fetch the first response m response FIFO store 
260, and forward said first response to the response substitution entity 270. The response 
substitution entity 270 can preferably, upon receiving said first response, fetch the first 
entries in data block FIFO store 245 and additional mformation FIFO store 250, said first 

15 entries being the data block 130 and the additional information 135 to which said first 
response corresponds. In fee response substitution entity 270. said first response can 
substitute relevant parts of data block 130 and additional informaition 135. This substitution 
should preferably be performed according to the data processing mformation 300 relating 
to response handling, which could be fetched fix)m access unit 215 by response substitution 

20 entity 270. 

In an alternative embodiment, the triggering signal 268 is transmitted to response 
substitution entity 270 as well as to access unit 215. The response substitution entity 270 
would then perform the fetching of tiie first entries of data block FIFO store 245 and 

25 additional information FIFO store 250 upon recqption of triggering agnal 268. In another 
embodiment, the triggering signal 268 is transmitted only to response substitution unit 270, 
which could fetch the response, the data block 130 and the additional information 135 upon 
reception of tiie tiiggering signal 268. hi yet another embodiment, tiie fetchmg of the first 
entries in data block FIFO store 245 and additional mformation FIFO store 250 is 

30 performedby accessunit215uponTeceptionofthetriggeringsignal268. Access unit215 
would then forward said first entries togeflier witii the first entry in response FIFO store 
260 to response substitution entity 270. This embodiment is particularly suitable if only 
one device 205 can be accessed at a time by an access point 200. Response substitution 
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entity 270 could comprise access unit 215, or access unit 21 5 could be implemented as a 
separate entity. In implementations where several devices 205 may be accessed 
simultaneously from an access point 200, response substitution entity 270 could comprise 
an access units 215 per transmit connection 210, 

The triggeriiig value 265 could be shifted into shift register 263 by request extraction entity 
237 -when the data block 130 and additional information 125 enter data block FIFO store 
245 and additional infomiation FIFO store 250, or by the access unit 21 5 v^n a request is 
sent to device 205, or 1^ the processing stage 120i preceding access point 200, as shovm in 
Fig. 2. 

The transmission of data to/from device 205iu from/to access point 200a is further 
illustrated in Fig. 4a. Transmit connection 210 and receive connection 255 make up a 
request channel. On transmit connection 210, access unit 21 5 transmits a request 400 to 
device 205, where the request is processed. When device 205 has finished the processing of 
request 400, .device 205 transmits a response 405 to access point 200a via receive 
connection 255. Response 405 is received by response FIFO store 260, from ^ch 
response 405 can be fetched by access unit 215 or response substitution entity 270. 

A request channel, comprising a transmit connection 210 and a receive connection 255, is a 
logical channel which is preferably dedicated to the transmission of requests from an 
access point 200 to one device 205. If e.g. TDMA (Time Division Multiple Access) is' 
used, a request channel could advantageously utilise one or more pre-defined time slots. In 
another embodiment, each request channel could correspond to a physical channel in a 
point-to-point configuration. 

An example of a request 400 is illustrated in Fig. 4b. Request 400 of Fig. 4b comprises an 
operation field 410 for the representation of ihe operation that is requested of the device 
205. Predetermined codes could be used for the representation of requested operations, so 
that data field 410 comprises the code that has been agreed to represent the requested 
operations. Advantageously, the coding scheme used for the representation of operations in 
data field 302 of data processing information 300 could be used. Furthermore, request 400 
of Fig. 4b comprises a request data field 415, containing the data that should be used in the 
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requested process. Request 400 of Fig. 4b fortiier comprises an address field 420. in which 
an address to a memory in device 205 can be transmitted. Should device 205 not require 
such addressing, address field 420 can preferably be omitted. 

In Fig. 4c, an example of a response 405 is illustrated. Response 405 preferably comprises 
a request complete field 425. which can advantageously be implemented as a flag which 
takes one value if the request has been completed, and another value should the request 
have failed. Response 405 further comprises a response data field 430, in which response 
data can be transmitted. Obviously, request 400 and/or response 405 could comprise other 
data fields when necessary. 

In one embodiment of the invention, additional information 135 comprises information 
relating to which instructions m the subsequent processing stage 120 should be executed. 
This information will hereinafter be referred to as the program counter. Preferably, the 
program counter could be stored, in a processing stage 120, in the special purpose register 
165 of addiUonal register 155, i.e. the program counter could preferably be part of the 
special purpose register contents. Advantageously, the program counter could be altered 
based on the operations perfonned by a device 205. hi an embodiment vy^ere the program 
counter is not part of the general purpose register contents 230, this could be achieved by 
including a field in data processing mformation 300, referred to as the program counter 
location. The program counter location could indicate where a program counter value, 
included in response 405. is to be stored. The program counter location could e.g. point at a 
location m the general purpose register contents 230. By exchanging the program counter 
value in the special purpose register contents of additional mformation 135 witix the value 
stored in tiie location mdicated by flie program counter location after &e response data field 
430 of response 405 has been merged with tiie general purpose register contents 230, tiie 
program counter can take a value resulting firom tiie operations performed by device 205. 
This exchange can preferably be performed by tiie response substitution entity 270. 
Alternatively, tiie program counter value included in response 405 could be stored as tiie 
new program counter in tiie special purpose register contents of additional information 135 
straight away when tiie response 405 is merged witii additional information 135 and data 
block 130. Obviously, otiier parameters not included in tiie general purpose register 
contents 230 could be treated in a similar way. 
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The value of data processing infonnation 300 can in a typical implementation of the 
invention vary between a rather limited number of combinations of values of data fields. 
The number of allowed combinations of data fields in data processing infonnation 300 
could typically take a value of 10 or 20. Rather than including the entire data processing 
information 300 in access point reference 240, one could advantageously instead include a 
representation of flie combination of data fields forming the data processing uiformation 
300, such representation being referred to as a driver identifier. The driver identifier could 
preferably be generated by one or several processing stages 120, and included in the 
additional information 135. In Fig. 5a, a driver table 500 is shown comprising six different 
driver identifier entries, each comprising a possible value of driver identifier 505, and 
correspondmg data processing uiformation entries comprising data processing information 
300 which the driver identifier values represent. Each value of driver identifier 505 in the 
driver table 500 could preferably refer to a different value of data processing information 
300. Obvioiisly, a driver identifier 505 could represent a value of data processing 
information 300 formed of a combination of any number of data fields, and does not have 
to be limited to the four data fields shown in Fig. 5a. A driver table 500 can advantageously 
be stored at access point 200, as will be described fijrther in relation to Fig. 7. In Fig. 5b, an 
example of an access point reference 240 for an embodiment using driver identifiers 505 is 
illustrated. Access point reference 240 of Fig. 5b comprises a driver identifier 505 and no 
data processing information 300. 

When usmg a driver table 500 m access point 200 and mcluding a driver identifier 505 in 
access point reference 240, rather than the entire data processing information 300, the 
processmg resources in the processmg stages 120 that are used for obtaining access point 
reference 240 can be reduced. Furthennot«, an abstraction layer is introduced to the access 
point 200, so that the values of data processing information 300 can be altered without 
changing the operation of the processing stages 120. The data processing information 300 
corresponding to a value of driver identifier 505 in driver table 500 can be altered, 
preferably during configuration of the access point 200. Hence, the introduction of driver 
identifier tables 550 to access point 200 facilitates for the exchange of a device 205 to a 
different device 205, or for the change of operations for which a device 205 can be used by 
access point 200. 
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A device 205 could advantageoiisly be accessed from different access points 200 in a 
pipeline 110. Hie processing resources provided by a device 205 can hence be efficiently 
utilised, since a device 205 can be used at different stages in the pipeline 1 10. A device 205 
that can be accessed from more than one access point 200 could advantageously comprise a 
request FIFO for each access point 200 from which the device 205 can be accessed, in 
order to avoid that requests 400 from different access points are mixed up. A further 
advantage" of such a request FIFO is that the device 205 can operate at clock frequency 
diSerentto Ihe clock frequency of the pipeline 110, so that, e.g., a device 205 may 
comprise a pipeline operating at a different clock frequency than pipeline 1 10. 
Furthermore, different operations that can be requested by the same access point 200 from 
a device 205 can use different number of clock cycles. Hence, a request FIFO could 
advantageously be implemented also in a device 205 which can be accessed by only one 
access point 200. 

Figs. 6a-e iUjistrate the flow of a data packet 125, comprising two data blocks 130, through 
an access point 200 comprising a shift register 263, a data block FIFO store 245, an 
additional infonnation FIFO store 250 and a response FIFO store 260. Fig. 6a shows the 
first clock tick, upon which the first of the two data blocks 130 of data packet 125 enters 
the data block FIFO store 245. At the same tune, additional information 135 associated 
with data block 125 enters the additional information FIFO store 250, and a triggering 
value 265 is initiated in shift register 263. hi order to simplify the description, shift register 
263 is shown to have only four positions, although in most implementations, shift register 
263 would be longer. A request 400, not shown in the figure, is transmitted to device 205. 
Upon the next clock tick, as is illustrated in Fig. 6b, the second data block 130 of data 
packet 125 enters tiie a data block FIFO store 245, and the triggering value 265 is moved 
forwards m shift register 263. MeanvMe, the request 400 is processed by device 205. 
Upon tiie following clock tick, as is shown m Fig. 6c, a response 405 is fransmitted from 
device 205 to the response FIFO store 260. Triggering value 265 is moved forwards m shift 
register 263, and anotiier data packet, comprising only one data block, enters the access 
point Another triggering value is shifted mto shift register 263 in relation to this another 
data packet Upon the next clock tick, as is illustrated m Fig. 6d, the triggering value 265 
reaches tiie last position in shift register 263. Vpon tiie followmg clock tick, a triggering 
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signal 268 is sent from shift register 263, to access unit 215 and/or response substitution 
unit 270, not shovm in the figure. As is illustrated in Fig. 6e, the first data block 130 in data 
block FIFO store 245, the additional information 135 m additional information FIFO store 
250 and the response 405 in the response FIFO store 260 are fetched, either by access unit 
215 or substitution entity 270, depending on the implementation of the invention. The data 
packet 125, the additional information 135 and the response 405 can then be processed 
accordii^ to the implementation, preferably m a response substitution entity 270. As can be 
seen in Fig. 6e, anotiier response, corresponding to the another data packet, has been sent 
from device 205 to response FIFO store 260. 

In Fig. 7, an access pomt 200b is shown, providing access to two different devices 205 via 
two separate request channels made up of transmit comiections 210a and 210b and receive 
connections 255a and 255b. Access via transmit coimection 210a is handled by an access 
imit 215a, while transmit coimection 210b is handled by an access unit 215b. Transmit 
comiection 210a is comiected between access imit 215a and a switch 220, while transmit 
<;onnection ^lOb is connected between access unit 21 Sb and switch 220. Transmit 
connection 210a of Fig. 7 can be configured to be coraiected via switch 220 to either one of 
devices 205i, 205ii, or 205iii, while transmit connection 210b can be configured to be 
connected via switch 220 to either one of devices 205iv and 205v. In Fig. 7, switch 220 is 
shown to be configured so that transmit connection 210a is connected to device 205ii, 
while transmit connection 210b is comiected to device 205 v. Hence, device 205ii and 20Sv 
can be accessed from access point 200b via transmit connection 210a and 210b, 
respectively. In alternative embodiments, a transmit connection 210 can be configured to 
be connected to any of the devices 205 connected to switch 200. Any device 205 that can 
be accessed by any access point 200 of processor 100 coidd sxiitably, but not necessarily, be 
connected to the same switch 200. 

Access point 200b of Fig. 7 comprises two driver identifier tables 500a and 500b, relating 
to the access of devices 205 connected to access point 200b via transmit coimection 210a 
and 210b, respectively. When a data block 130 enters access point 200b, it is preferably 
accompanied by additional information 135 comprising general purpose register content 
230 and access point reference 240, the latter comprising driver identifiers 505a and 505b. 
In analogy to Fig. 2, general purpose register content 230 could preferably enter access 
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units 215a and 21 5b. Driver identifier 505a enters driver table 500a, in which the value of 
data procesang information 300 corresponding to the value of driver identifier 505a is 
identified. The data processing information 300a is then forwarded to access unit 215a, 
which uses data processing information 300a for determniing which information should be 

5 included in request 400a to be sent on transmit connection 2 i Oa (cf. Fig. 4). In a similar 
■ maimer, driver identifier 505b enters driver table 500b, in which the value of data 
processing information 300 corresponding to the value of driver identifier 505b is 
identified. The data processing information 300b is then forwarded to access unit 21 5b, 
which prepares request 400b, to be sent on transmit connection 210b, accordmg to data 

1 0 processing information 300b. 

In the embodiment of the invention shown in Fig. 7, general purpose register contents 230 
and the access point reference 240 comprismg the driver identifiers 505 are extracted by 
the processing stage 120i preceding access point 200b. However, access point 200b could 
1 5 alternatively have a request extraction entity 237 in which the general puipose register 
contents 230. and access point refetraice 240 could be extracted, cf. Fig. 2. 

When device 205ii has perfonned the operation requested of device 205ii. a response 405a 
is transmitted on receive connection 255a to response FIFO store 260a. SimUarly, when 

20 device 205v has performed the operation requested of device 205v, a response 405b is 

transmitted on receive connection 255b to response FIFO store 260b. Responses 405a and ' 
405b can then be forwarded to access unit 215a and access unit 215b, respectively, on 
demand. In order to guarantee that responses 405a and 405b wiU be associated to the 
correct data block 130 in access units 215a and 215b. a fixed time delay mechanism could 

25 be included in the access point 200b, the fixed time delay provided by the fixed time delay 
mechanism bemg adjusted to the number of clock ticks needed in order to complete the 
most demanding operation that could be performed by any of devices 205i-v. In fig. 7, the 
fixed time delay mechanism is shown to be a shift register 263. When a data block 130 
enters access point 200b, a triggering value 265 could be shifted into shift register 263. 

30 When triggering value 265 reaches the end of shift register 263, the operations required by 
the two devices 205 connected to transmit connections 210a and 210b will have been 
completed and the responses 405a and 405b wUl have been stored m response FIFO store 
260a and 260b, respectively. When triggering value 265 reaches the end of shift register 
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263, the triggering value 265 could suitably be sent as a triggering signal 268 to access 
units 215a and b, the triggering signal 268 indicating that the responses 405a and 405b 
relating to the first entries in data block FIFO store 245 and additional information FIFO 
store 250 are now stored in response FIFO stores 260a and 260b, respectively. Upon 
reception of the triggering signal 265, access units 215a and 215b can preferably fetch the 
first responses 405a and 405b in response FIFO stores 260a and 260b, and then forward 
said responses to the response substitution entity 270. Preferably, access units 215 also 
forward the part of data processing information relating to response handling, stored in the 
access unit FIFO, to the response substitution entity 270. Triggering signal 268 could be 
sent firom shift register 263 also to response substitution entity 270, which upon reception 
of the triggering signal 268 could fetch the first entries in data block FIFO store 245 and 
additional information FIFO store 250, respectively. Alternatively, response substitution 
entity 270 could fetch said first entries, corresponding to data block 130 and additional 
information 135, responsive to the reception of the first response 4p5a and 405b 
transmitted by access units 21 5a and 21 5b, or the triggering signal 268 could be transmitted 
to response substitution entity 270, die response substitution entity 270 fetching the first 
response of die relevant FIFO stores responsive to die reception of die triggering signal 
268. In the response substitiition entity 270, the responses 405a and 405b which have been 
fetched ftom the response FIFO stores 260a and 260b can substitute relevant parts of data 
block 130 and additional infoimation 135. This substitution should preferably be 
performed according to the information in the relevant part of the data processing 
infoimation 300a and 300b, which preferably could be transmitted fix)m access units 215a 
and 21 5b to response substitution entity 270. Should any part of the response 405a for 
some reason be indicated to be stored at tiie same location as a part of response 405b, 
precedence rules could be set up during configuring of tiie access point 200b. 

In Fig. 8, amethod of simultaneously accessing two devices 205i and 205ii fi:om an access 
point 200 according to tiie invention is Ulustrated- In step 800, a data block 130 and 
associated additional information 135 is received by access point 200. Data processing 
infoimation 300a and 300b. relevant to tiie access of devices 205i and 205ii, respectively, 
are tiien extracted. In an embodiment comprising driver identifier tables 500, this 
extraction is preferably performed by using driver identifiers 505a and 505b in additional 
information 135 to find the relevant data processing information 300a and 300b in driver 
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identifier tables 500a and 500b. The extraction of driver identifier 505a and 505b from 
additional information 135 could advantageously be performed in a request extraction 
entity 237. Alternatively, in an embodiment which does not comprise data processing 
information tables 500, the data processing information 300a and 300b could be extracted 
5 in the request extraction entity 237. In an implementation of access pomt 200 whicli does 
not comprise a request extraction entity 237, the extraction of access point reference 240 
could e.g. be performed by the processing stage 120 preceding access point 200. 

In step 810 of Fig. 8, tiie data block 130 and tiie additional information 135 are stored in a 
10 data block FIFO store 245 and an additional information FIFO store 250, respectively. Step 
815 is then entered, in v^ch a fixed time delay mechanism is initiated. The fixed time 
delay mechanism could altematively be initiated before step 805 or 810, or after step 820. 
The time delay provided by the fixed time delay mechanism could then, if necessary, be 
adjusted accordingly. 

In step 820, a request 400a, based on tiie information in data processing information 300a, 
is transmitted to device 205i, and a request 400b, based on tiie information in data 
processing information 300b, is tiansmitted to device 205ii. The request 400a and 400b are 
advantageously prepared and transmitted by access units 215a and 21 5b, respectively. In 

20 step 825, the elapse of the fixed time delay initiated by the fixed time delay mechanism is 
awaited. When the fixed time delay mitiated by the fixed time delay mechanism has 
elapsed, step 830 is entered, in -w^ich a triggering signal 268 is transmitted, indicating that 
responses 405a and 405b associated witii data block 130 are stored as the first entiies in 
response FIFO storey 260a and 260b, respectively. The transmit signal 268 can 

25 advantageously be t^smitted to tiie access units 215, and/or tiie response substitution 

entity 270. In step 835, the responses 405a and 405b, as weU as tiie data block 130 and the 
additional mfoimation 135, are fetched in respective FIFO stores by access units 21 5a and 
215b, or by response substitution entity 270. The relevant parts of responses 400a and 400b 
are tiien merged into tiie relevant positions in data block 130 and/or additional information 

30 135 in step 840, tiie merging preferably taking place in response substitution entity 270. In 
step 845, tiie data block 130 and additional information 135 resulting from step 840 are 
forwarded to tiie proceeding processing stage 120. The process then ends at stq? 850. 
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Fig. 9 is a flowchart illustrating a procedure performed, according to one embodiment of 
the invention, by an access unit 215 when a data block 130 has been received by access 
point 200. In step 900, data processing information 300 and general purpose register 
contents 230 associated with a data block 130. is received by access unit 215. Instep 905, 

5 access umt 215 prepares a request 405 according to data processing information 300 
received in step 900. The request 405 is then transmitted, m step 910, to device 205 on 
transmit connection 210. Step 913 is then entered, in which the part of data processing 
informatidn 300 relevant to the handling of the response 405 which vwll be received from 
device 205 is stored in the access unit FIFO store. In step 915, access unit 215 awaits a 

10 triggering signal 268. When the triggering signal 268 has been received, step 920 is 

entered, in which response 405 is fetched in response FIFO store 260. S\sp 925 is then 
entered, in which the relevant parts of the response 405, such as e.g. flie response data field 
430, is forwarded to response substitution entity 270, to be merged uato additional 
information 135 and/or data block 130 according to the data processing information 300 

15 received in step 900. Preferably, the first entry in access unit FIFO store could also be 
^rwarded to response substitution unit 270 in step 930. In step 930, the procedure is 
ended. 

The procedure of Fig. 9 could be modified in several ways. In one embodiment of the 
20 invention, the access unit 215 may be adi^ted to fetching the first entries m data block 
FIFO store 245 and in additional information FIFO store 250 upon reception of the 
triggering signal. The merging of the response 400 into data block 130 and/or additional 
information 1 35 could then be performed by access unit 21 5, or by substitution entity 270. 
Furthermore, in an embodiment where access point 200 provides access to several devices 
25 205, access unit 215 may check which response should take precedence if the data 

processing information 300 specifies that more than one respoi^e 405 should be stored at 
tiie same location in additional information 135 or data block 130, and then act 
accordingly. Alternatively, this check could be performed by response substitution entity 
270. The access unit 215 could fiirthermore preferably perform a check as to whether the 
30 requested operation has been successfially performed by device 205. If request complete 
field 425 of response 405 shows that the request has not been completed, the access unit 
21 5 could e.g. decide that data block 130 should be discarded 
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Access points 200a and 200b, shown in Fig. 2 and Fig. 7 respectively, could be varied in 
many ways. Obviously, an access point 200 could have any number of request channels 
providii^ simultaneous access to any number of devices 205. The synchionisation 
mechanism could be implemented as completion driven conditional logic, rather than as 
5 tile fixed time delay mechanism illustrated by Fig. 2 and 7. hi access point 200b, access 
vaAt 215a and 21 5b could be implemented as an integrated access unit 215. Request 
extraction entity 237 and/or response substitution entity 270 could then be implemented as 
part of access unit 215. Furthermore, driver tables 505a and 505b could be implemented as 
an integrated driver table 505, as long as the values available to driver identifier 505a are 
10 diffecent to the valxies available to driver identifier 505b. hi one embodiment of the 

invention, access unit 215 can be hnplemented as a request port and a separate response 
port Moreover, data block FIFO store 145 and additional mformation FIFO store 150 
coxild be implemented in the same FIFO store. 

15 An access point 200 can preferably be located anywhere in pipeline 1 1 0, as well as between 
ipput 105 and pipeline 1 10, or between pipelme 1 10 and output 115. If an access point 200 
is located before the pipeline 1 10, the access pomt reference 240 can obviously not be 
generated by a processing stage 120 of pipehne 1 10, but could e.g. be generated by input 
105, or be extracted from additional mformation 135 by a request extraction entity 237. A 

20 device 205, external to the pipeUne 1 10, could be implemented as a device mtemal to 

processor 100, located on the same chip as processor 100, or could be unplemented as an 
external device, located on a different chip. 

The processor 100 could be implemented as an integrated circuit (i.e. as an ASIC), as part 
25 of an integrated circuit, or as many integrated circuits connected to each other. 

The inventive processor 100 could advantageously be unplemented m any node in a data 
communication system, in which data packets are processed. Examples of such nodes are 
routers and telecommunication switches for packet data. A processor 100 could then be 
30 part of a computer unit, such as a network computer unit or a signal processmg computer 
unit. 
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One skilled in the art will appreciate that the present invention is not limited to the 
embodiments disclosed in the accompanying drawings and the foregoing detailed 
description, which are presented for purposes of illustration only, but it can be 
implemented in a number of different ways, and it is defined by the following claims. 



